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^"^ ■ Nominal Logic is a version of first-order logic with equality, name-binding, renaming 

#Vj ' swapping and freshness of names. Contrarily to higher-order logic, bindable names, called atoms, 

and instantiable variables are considered as distinct entities. Moreover, atoms are capturable by 

instantiations, breaking a fundamental principle of lambda-calculus. Despite these differences, 

r'~'') ■ nominal unification can be seen from a higher-order perspective. From this view, we show that 

nominal unification can be reduced to a particular fragment of higher-order unification problems: 

Higher-Order Pattern Unification. This reduction proves that nominal unification can be decided 

C^ , in quadratic deterministic time, using the linear algorithm for Higher-Order Pattern Unification. 

O 1 We also prove that the translation preserves most generality of unifiers. 
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Mathematical Logic — Lambda Calculus and Related Systems 

General Terms: Lambda Calculus, Nominal Logic, Automated Theorem Proving, Term Rewriting 
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O ' 1. INTRODUCTION 



Nominal Logic is a version of first-order many-sorted logic with equality and prim- 
itives for renaming via name-swapping, name-binding, and freshness of names. It 
is characterized by a syntactic distinction between atoms (that roughly correspond 
to the notion of bound variable) and variables (that would correspond to free vari- 



Jh I ables). Therefore, binders can only bind atoms, we can only instantiate variables, 

and atoms are not instantiable even if they are not bounded. It also provides 
a new- quantifier [Gabbay and Pitts 1999], to model name generation and local- 
ity. Nominal logic was introduced at the beginning of this decade by Gabbay and 
Pitts [Gabbay and Pitts 1999; Pitts 2001; Gabbay and Pitts 2001; Pitts 2003]. 
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2 • J. Levy and M. Villaret 

These first works have inspired a sequel of papers where bindings and freshness 
are introduced in other areas, like nominal algebra [Gabbay and Mathijssen 2006; 
2007; 2009], equational logic [Clouston and Pitts 2007], rewriting [Fernandez and 
Gabbay 2005; 2007], unification [Urban et al. 2003; 2004], and Prolog [Cheney and 
Urban 2004; Urban and Cheney 2005]. 

This paper is concerned with Nominal Unification, the problem of deciding if 
two nominal terms can be made a-equivalent by instantiating their variables by 
nominal terms. In this instantiation, variables are allowed to capture atoms. Ur- 
ban, Pitts and Gabbay [Urban et al. 2003; 2004] describe a sound and complete, 
but inefficient (exponential), algorithm for nominal unification. Fernandez and 
Gabbay [Fernandez and Gabbay 2005] extend this algorithm to deal with the new- 
quantifier and locality. Nominal Logic's equivariance property suggested to Ch- 
eney [Cheney 2005a] a stronger form of unification called equivariant unification. 
He proves that equivariant unification and matching are NP-hard problems. An- 
other variant of nominal unification is permissive unification, defined by Dowek, 
Gabbay and Mulligan [Dowek et al. 2009; 2010], that is also reducible to Higher- 
Order Pattern Unification. Calves and Fernandez describe in [Calves and Fernandez 
2007] a direct but exponential implementation of a nominal unification algorithm 
in Maude, and in [Calves and Fernandez 2008] a polynomial implementation, based 
on the use of a graph representation of terms, and a lazy propagation of swappings. 
In [Levy and Villaret 2008] we prove that Nominal Unification can be decided in 
quadratic time by reduction to Higher-Order Pattern Unification. The present pa- 
per is an extension of this preliminary paper, where we have simplified the reduction 
by removing freshness equations, and we have included the proof of some important 
properties of pattern unifiers. In particular, we prove that most general higher-order 
pattern unifiers can be written without using other bound- variable names than the 
ones used in the presentation of the unification problem. Moreover, we establish 
a precise correspondence between most general nominal unifiers and most general 
pattern unifiers. Sections 4, 6 and 8 are completely new in this extended version. 
Recently, Calves and Fernandez [Calves 2010], and ourself [Levy and Villaret 2010] 
have independently found direct quadratic nominal unification algorithms based on 
the Paterson and Wegman's linear first-order unification algorithm [Paterson and 
Wegman 1978]. 

The use of a-equivalence and binders in nominal logic immediately suggests to 
look at nominal unification from a higher-order perspective, the one that we adopt 
in this paper. Some intuitions about this relation were already roughly described 
by Urban, Pitts and Gabbay in [Urban et al. 2004]. Cheney [Cheney 2005b] reduces 
higher-order pattern unification to nominal unification (here we prove the opposite 
reduction) . 

The main benefit of nominal logic, compared to higher-order logic, is that it allows 
the use of binding and a-cquivalence without the other difficulties associated with 
the A-calculus. In particular, with respect to unification, we have that nominal 
unification is unitary (most general unifiers are unique) and decidable [Urban et al. 
2003; 2004], whereas higher-order unification is undecidable and infinitary [Lucchcsi 
1972; Goldfarb 1981; Levy 1998; Levy and Veanes 2000]. In this paper we fully 
develop the study of nominal unification from the higher-order logics' view. We 
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Nominal Unification from a Higher-Order Perspective • 3 

show that full higher-order unification is not needed, and Higher-order Pattern 
Unification sufBces to encode Nominal Unification. This subclass of problems was 
introduced by Miller [Miller 1991]. Contrarily to general higher-order unification, 
higher-order pattern unification is decidable and unitary [Miller 1991; Nipkow 1993] . 
Moreover, unifiability can be decided in linear time [Qian 1996]. All this will lead 
us to show how to reduce nominal unification to higher-order pattern unification, 
and to conclude its decidability in quadratic deterministic time. 

From a higher-order perspective, nominal unification can be seen as a variant of 
higher-order unification where: 

(1) variables are all first-order typed, and constants are of order at most three, 

(2) unification is performed modulo a-equi valence, instead of the usual a and /?- 
equivalence, 

(3) instances of variables are allowed to capture atoms, contrarily to the standard 
higher-order definition, and 

(4) apart from the usual equality predicate, we use a freshness predicate a^t with 
the intended meaning: atom a does not occur free in t. 

The third point is the key that makes nominal unification an interesting subject 
of research. Variable capture is always a trouble spot. Roughly speaking, the main 
idea of this paper is to translate atoms into bound variables, and variables into 
free variables with the list of atoms that they can capture as arguments. The first 
point will ensure that, since variables do not have parameters, after translation, the 
only arguments of free variables will be list of pairwise distinct bound variables, 
hence higher-order patterns. Moreover, since bound variables will be first-order 
typed, and constants third-order typed, the translated problems will be second- 
order patterns. The second point is not a difficulty. Since all nominal variables 
are first-order typed, their instantiation does not introduce /3-redexes. Finally, the 
fourth point can also be overcome by translating freshness equations into equality 
equations, as described in Section 4. 

The remainder of the paper proceeds as follows. After some preliminaries in 
Section 2, in Section 3 we illustrate by examples the main ideas of the reduction at 
the same time that we show the main features of nominal unification. In Section 4, 
we prove that freshness equations can be linearly translated into equality equa- 
tions. In Section 5, we show how to translate a nominal unification problem into 
a higher-order patterns unification problem. Then, after proving some properties 
of Higher-Order Pattern Unification in Section 6, we prove that this translation 
is effectively a quadratic time reduction, in Section 7. In Section 8, we establish 
a correspondence between nominal unifiers and pattern unifiers of the translated 
problems. In particular, we prove that the translation function and its inverse are 
monotone w.r.t. the more general relation, and both translate most general unifiers 
into most general unifiers. We conclude in Section 9. 

2. PRELIMINARIES 

In this section we present some basic definitions of Nominal Unification and Higher- 
Order Pattern Unification. We will use two distinct typographic fonts to represent 
nominal terms and A-tcrms along this paper. 
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4 • J. Levy and M. Villaret 

2.1 Nominal Unification 

Nominal terms contain variables and atoms. Only variables may be instantiated, 
and only atoms may be bounded. They roughly correspond to the notions of free 
and bound variables in A-calculus, respectively, but are considered as completely 
different entities. However, atoms are not necessarily bounded, and when they 
occur free, they are not instantiable. 

In nominal signatures we have sorts of atoms (typically v) and sorts of data 
(typically 6) as disjoint sets. Atoms (typically a,b, ...) have one of the sorts of 
atoms. Variables, also called unknowns, (typically X,Y, ...) have a sort of atom 
or sort of data, i.e. of the form v \ S. Nominal function symbols (typically f , g, . . .) 
have an arity of the form ti x ■ • ■ x t„ ^> (5, where (5 is a sort of data and Ti are 
sorts given by the grammar r ::— i^\S \ {v)t. Abstractions have sorts of the form 
{u)r. 

Nominal terms (typically t, u, . . .) are given by the grammar: 

t::=f(ti,...,tp)|a|a.t|7r-X 

where f is a n-ary function symbol, a is an atom, tt is a permutation (finite list 
of swappings) , and X is a variable. They are called respectively application, atom, 
abstraction and suspension. The set of variables of a term t is denoted by Vars(t). 

A swapping (a b) is a pair of atoms of the same sort. The effect of a swapping 
over an atom is defined by (a b) • a = b and (a b) • b = a and (a b) ■ c = c, when 
c 7^ a,b. For the rest of terms the extension is straightforward, in particular, 
(ab)-(c.t) — ((a b)-c).((a b)-t). A permutation is a (possibly empty) sequence of 
swappings. Its effect is defined by (ai bi) . . . (an bp)-t ~ (ai bi)-((a2 ^2) ■ . ■ (a^ bp)-t). 
Notice that every permutation tt naturally defines a bijective function from the set 
of atoms to the sets of atoms, that we will also represent as tt. Suspensions are uses 
of variables with a permutation of atoms waiting to be applied once the variable 
is instantiated. Occurrences of an atom a are said to be bound if they are in the 
scope of an abstraction of a, otherwise are said to be free. 

Substitutions are finite sets of pairs [Xi i~> ti, . . . , Xp i—> tp] where Xj and t; have 
the same sort, and the Xj's are pairwise distinct variables. They can be extended to 
sort-respecting functions between terms, and behave like in first-order logic, hence 
allowing atom capture. For instance [X 1— >■ a]a.X = a. a. Remember that when 
applying a substitution to a suspension, the permutation is immediately applied, 
for instance 

[X^g(a)]f((ab).X,X) =f((ab).g(a),g(a)) -f(g((ab).a),g(a)) =f(g(b),g(a)) 

The domain of a substitution a — [Xi i— > ti, . . . , Xn n> tp] is Dom((T) = {Xi, . . . , Xp}. 
For convenience we consider Dom([X M> X]) = {X} ^ {Y} = Dom([Y 1— > Y]), 
although both substitutions have the same effect when applied to any term.-'^ 
Composition of substitutions is defined by ai o (T2 = [X h^ cri(o'2(X)) | X G 
Dom((Ti) U Dom((T2)]. The restriction of a substitution cr to a set of variables 
V, written cr[v, is defined as cr|v = [X n> cr(X) | X G V]. 

A freshness environment (typically V) is a list of freshness constraints a # X 
stating that the instantiation of X cannot contain free occurrences of a. 



'^We have adopted this definition motivated by Remark 5.9. 

ACM Transactions on Computational Logic, Vol. V, No. N, Month 20YY. 



Nominal Unification from a Higher-Order Perspective • 5 

The notion of a-equivalence between terms, noted «, is defined by means of the 
following theory: 

, , a#X G V for all a such that TT-a 7^ 7r'-a 
(«-atom) — ^ ^ , ^ («-susp.) 



Vhawa' ' Vh7r-X«7r'-X 



Vhti«tl ••• Vhtn«t'„ 

(«-application) 



Vhf(ti,...,tn)«f(tl,...,t'J 



Vht«t' , ,,., a^a' Vht^(aa0.t^ V h a#t' ^ 

- (^-abst-1) --; (f^-abst-2) 



V I- a.t « a.t 
where the freshness predicate 
a 7^ a' 





Vh 


a.t « 


a'.t' 


is 


defined by: 






^ 


(.-!.. 


i#X) 


e V 


) 


Vh 


a#7r 


•X 



7 (#-atom) „, ^ „ TT- (#-susp 



V h a#a' 

V h a#ti ■ ■ ■ V h a#t, 

Vha#f(ti,...,t„) 



(^-application) 



(#-abst-l) ^^J^ I^,!^^ (#-abst-2) 



Vha#a.t ^^ ^ Vha#a'.t 

Their intended meanings are: 

— V h a 7^ t holds if, for every substitution a respecting the freshness environment 
V (i.e. avoiding the atom captures forbidden by V), a is not free in cr(t); 

— V h t « u holds if, for every substitution a respecting the freshness environment 
V, t and u are a-equivalent. 

A nominal unification problem (typically P) is a set of equations of the form t « u, 

or of the form a # ■ t, called equality equations and freshness equations, respectively. 

A solution or unifier of a nominal problem P is a pair (V, cr) satisfying V h 

a # o'(t), for all freshness equations a # t G P, and V h (j(t) « o'(u), for all equality 

? 
equations t w u G P. Later, in Section 5, wc will also require solutions to satisfy 

Dom(cr) = Vars(P). In Remark 5.9 we justify why this does not affect to solvability 

of nominal problems. 

Given two substitutions ai and a2, and two freshness environments Vi and V2, 

we say that V2 h criCVi), if V2 h a # cri(X) holds for each a ^^ X G Vi; and we say 

that Vi h cri « (72, if Vi h cri(X) « cr2(X) holds for aU X G Dom(cri) U Dom(cr2)- 

Given a nominal unification problem P, we say that a solution (Vi,cri) is more 

general than another solution (V2,o'2), if there exists a substitution a' satisfying 

V2 h ^'(Vi) and V2 I- cr'o(ji|Doni(<Ti)uDom(cr2) ~ '''2- As usual, we say that a solution 

a is most general if, for any other solution a' more general than cr, we have also 

that a is also more general than cr'. Most general nominal unifiers are unique, in 
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6 • J. Levy and M. Villaret 

the usual sense: if ai and (T2 are both most general, then ci is more general than 
{72 , and vice versa. 

Example 2.1. The solutions of the equation a.X w b.Y can not instantiate X 
with terms containing free occurrences of the atom b, for instance if we apply the 
substitution [X i— > b] to both sides of the equation we get [X i-t- b](a.X) = a.b, for 
the left hand side, and [X i— s- b](b.Y) — b.Y, for the right hand side, and obviously 
a.b f» b.Y is unsolvable. 

A most general solution of this equation is ({b#X}, Y ^^ (a b)-X]). Another most 
general solution is ({a=i^Y}, [X i-^ (ab)-Y]}. Notice that the first unifier is equal 
to the second composed with ct' = [Y h^ (a b) • X], hence the second one is more 
general than the first one. Similarly, the first one is more general that the second 
one. Hence, both are equivalent. 

2.2 Higher-Order Pattern Unification 

In higher-order signatures we have types constructed from a set of basic types (typ- 
ically djV, . . .) using the grammar r ::— (5 1 i^ | r ^ t, where — ?► is associative to the 
right. Variables (typically X, Y, Z, x, y, z, a,b, . . .) and constants (typically /, c, . . .) 
have an assigned type. 

X-terms are built using the grammar 

t ::~ a; I c I Xx.t \ ti t2 

where x is a variable and c is a constant, and are typed as usual. For convenience, 
terms of the form (. . . (aii) . . . t„), where a is a constant or a variable, will be written 
as a(ti, . . . , t„), and terms of the form Axi. • • • .Ax„.t as Axi, . . . , x„.i. We use x as a 
short-hand for xi, . . . , x„. If nothing is said, terms are assumed to be written in rj- 

long /3-normal form. Therefore, all terms have the form Axi Xxn-h{ti, . . . , tm)^ 

where to, n > 0, /i is either a constant or a variable, ti, . . . ,tm have also this form, 
and the term h{ti, . . . , i^) has a basic type. 

Other standard notions of the simply typed A-calculus, like bound and free oc- 
currences of variables, a-conversion, ^-reduction, rj-long ^-normal form, etc. are 
defined as usual (see [Dowek 2001]). We will notate free occurrences of variables 
with capital letters X,Y, . . ., for the sake of readability. The set of free variables of 
a term t is denoted by Vars(i). When we write an equality between two A-terms, 
we mean that they arc equivalent modulo a, l3 and rj equivalence. When we write 
an equality =„, we mean that they are a-equivalent. 

Substitutions are finite sets of pairs a = [Xi i— >■ ii, . . . ,X„ ^^ t„] where Xi and 
ti have the same type and the X^'s are pairwise distinct variables. They can be 
extended to type preserving function from terms to terms as usual. The domain 
is Dom((T) = {Xi, . . . , Xn}- We say that a substitution ai is more general than 
another substitution CT2, if there exists a substitution cr' satisfying cr' o (Ji{X) = 
a2{X), for all X e Dom(CTi) U Dom(CT2). We say that a variable X occurs in a 
substitution a, li X G Vars(CT(y)), for some Y e Dom((T). 

A higher-order unification problem is a finite set of equations P = {ti = 
Ui,...,tn = Un}, where ti and Ui have the same type. A solution or unifier of 
a unification problem P is a substitution a satisfying a{t) — (j{u), for all equations 
t = u E P. We say that a unifier a is most general if, for any other unifier cr' more 
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general than cr, we have a is also more general than a' . 

A higher-order ■pattern is a A-term where, when written in /?ry-nornial form, all 
free variable occurrences are applied to lists of pairwise distinct bound variables. 
For instance, \x.j{X{x\Y\ f{c,Xx.x) and Xx,y.X{Xz.x{z),y) are patterns, while 
\x.f{X{X{x)),Y), f{X{c), c) and Xx.\y.X{x, x) are not. Notice that, since Xz.x{z) 
is equivalent to x, the parameters of X(Xz.x{z), y) are considered a list of pairwise 
distinct bound variables. 

Higher- order pattern unification is the problem of deciding if there exists a unifier 
for a set of equations between higher-order patterns. Like in nominal unification, 
most general pattern unifiers are unique. Moreover, most general unifiers instantiate 
variables by higher-order patterns. 

The following is a set of rules defining Nipkow's algorithm [Nipkow 1993] that 
computes, when it exists, the most general unifier of a pattern unification problem. 





Xx . s = 


= Xx.t ^ {{s = t},[]) 


a{ti, 


...,tn)^ a(ui,. 


■■,Un) -^ ({ti =Ui,...,i„ =«„},[]) 

where o is a constant or bound variable 




Y(x) = a{ui, . . 


•,■"„) -> {{Yi{x) ^Ui,...,Ymix) =Um}, 



[Y^Xx.a{Yi{x),...,Ym{x))]) 
where Y ^ FV(ui, . . . , Um) 
and a is a constant or a G {x} 

X{x)=^X{y) -^ (0, [Xh- Af.Z(z)]) 

where {z} = {xi \ x^ = yj 

X{x) ^ Y{y) -^ (0, [X ^ Xx.Z{z), Y ^ Xy . Z{z)]) 
where X ^Y and {z] = {x} f) {y} 

The rules transform any equation into a pair (set of equations, substitution). 
The algorithm proceeds by replacing the equation on the left of the rule by the set 
of equations on the right. The substitution is applied to the new set of equations, 
and used to, step by step, construct the unifier. Therefore, any rule of the form 
i = w — > {E, p) produces a transformation of the form 

{P\j{t = u}, a) =^ {p{P) LiE,poa) 

The algorithm starts with the pair {P, Id) and, if P is solvable, finishes with (0, cr), 
where a with domain restricted to FV(P) is the most general unifier [Nipkow 1993, 
Theorem 3.1]. 

In the first rule the binder can be removed because, in Nipkow's presentation, 
free and bound variable names are assume to be from distinct sets, and can be 
distinguished. The equations on the right of the second rule may not be normalized, 
i.e. the term Xx.Yi{xi, . . . , x„) may require a yy-expansion when Ui is not base typed. 

There is an algorithm that finds higher-order pattern unifiers, if exist, in linear 
time [Qian 1996]. 

3. FOUR EXAMPLES 

In order to describe the reduction of nominal unification to higher-order pattern 
unification, we will use the unification problems proposed in [Urban et al. 2003; 
2004] as a quiz. 
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8 • J. Levy and M. Villaret 

Example 3.1. The nominal equation 

a.b.f(Xi,b) «b.a.f(a,Xi) 

has no nominal unifiers. Notice that, although unification is performed modulo 
a-equivalcnce, as far as we allow atom capture, we can not a-convert terms before 
instantiating them. Therefore, this problem is not equivalent to 

a.b.f(Xi,b) wa.b.f(b,Xi) 

which is solvable, and must be a-converted as 

a.b.f(Xi,b) «a.b.f(b, (ab)-Xi) 

Recall that (a b)-Xi means that, after instantiating Xi with a term that possibly 
contain a or b, we have to exchange these variables. 

According to the ideas exposed in the introduction, we have to replace every 
occurrence of Xi by Xi{a, b), since (a, b) is the list of atoms (bound variables a, b) 
that can be captured. We get: 

Xa.\b.f{Xi{a,b),b) ^ \b.Xa.f{a,Xi{a,b)) 

Since this is a higher-order unification problem, we can a-convcrt one of the sides 
of the equation and get: 

Xa.Xb.f{Xi{a,b),b) == Xa.Xb.f{b,Xi{b,a)) 

which is unsolvable, like the original nominal equation. 

Example 3.2. The nominal equation 

a.b.f(X2,b) «b.a.f(a,X3) 

is solvable. Its translation is 

Xa.Xb.f{X2ia,b),b) ^ Xb.Xa.f{a,X3{a,b)) 

The most general unifier of this higher-order pattern unification problem is 

X2 ^ Xx.Xy.y 
X^ ^^ Xx.Xy.x 

Now, taking into account that the first argument corresponds to the atom a, and 
the second one to b, we can reconstruct the most general nominal unifier as: 

X2H^ b 
X3 ^^ a 

Example 3.3. In some cases, there are interrelationships between the instances 
of variables that make reconstruction of unifiers more difficult. This is shown with 
the following example: 

a.b.f(b,X4) «b.a.f(a,X5) 

that is solvable. Its translation results on: 

Xa.Xb.f{b,Xiia,b)) = Xb.Xa.f{a,X5{a,b)) 
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Nominal Unification from a Higher-Order Perspective • 9 

and its most general unifier is:^ 

X^ H- > Xx.Xy.X^{y,x) 

This higher-order unifier can be used to reconstruct the nominal unifier 

X4H^ (a b)-X5 

The swapping (a b) comes from the fact that the arguments of X5 and the lambda 
abstractions in front have a different order. 

Example 3.4. The solution of a nominal unification problem is not just a substi- 
tution, but a pair (V, a) where cr is a substitution and V is a freshness environment 
imposing some restrictions on the atoms that can occur free in the fresh variables 
introduced by a. The nominal equation 

a.b.f(b,X6) wa.a.f(a,X7) 
has as solution 

a=[X6^(ba)-X7] 

V = {b#X7} 

where the freshness environment is not empty and requires instances of X7 to not 
contain (free) occurrences of b. Let us see how this is reflected when we translate the 
problem into a higher-order unification problem. The translation of the equation 
using the translation algorithm results on: 

\a.\b.f{b, Xe{a, b)) ^ Xa.\a.f{a, ^7(0, &)) (1) 

After a convenient a-conversion we get 

Aa.Ac./(c, Xe(a, c)) = Aa.Ac./(c, ^7(0, 6)) 

The most general unifier is again unique: 

Xe ^ Xx.Xy.Xs{y,b) 
Xj n> Xx.Xy.Xs{x,y) 

Nevertheless, in this case we cannot reconstruct the nominal unifier. Moreover, 
by instantiating the free variable 6, we get other (non-most general) higher-order 
unifier without nominal counterpart. The translation does not work in this case be- 
cause b occurs free in the right hand side of (1). We translate both atoms and nom- 
inal variables as higher-order variables. Occurrences of nominal variables become 
free occurrences of variables, and occurrences of atoms, if are bounded, become 
bound occurrences of variables. Therefore, in most cases, after the translation the 
distinction atom/variable become a distinction free/bound variable. However, if 
atoms are not bounded, as in this case, they are translated as free variables, hence 
are instantiable, whereas atoms are not instantiable. 

To avoid this problem, we have to ensure that any occurrence of an atom is 
translated as a bound variable occurrence. This is easily achievable if we add 



•^Thc unifier X5 h^ \x.Xy.X4,{y, x) is equivalent modulo variable renaming. In this case we obtain 
the also equivalent nominal unifier X5 h->. (a b)X4. 
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10 • J. Levy and M. Villaret 

binders in front of both sides of the equation. Therefore, the correct translation of 
this problem is: 

Aa.A6.Aa.A6./(fe, Xg(a, 6)) = Xa.Xb.Xa.Xa.f{a, X'j{a,b)) 

where two new binder Aa.A6 have been introduced in front of both sides of the 
equation. The most general unifier is now: 

Xe ^ Xx.Xy.Xsiy) 
X-! n> Xx.Xy.X^{x) 

This can be used to reconstruct the nominal substitution: 

X6^(a b)-X8 

Xy^Xs 

As far as X'g,{x) is translated back as Xs, and X^{x) does not uses the second 
argument (the one corresponding to b), we have to add a supplementary condition 
ensuring that Xg does not contain free occurrences of b. This results on the freshness 
environment {b^^Xs}. Then, X^(y) is translated back as (a b)-X8. 

4. REMOVING FRESHNESS EQUATIONS 

In this section we show that freshness equations do not contribute to make nominal 
unification more expressive. We prove that nominal unification can be linearly- 
reduced to nominal unification without freshness equations. We call this restriction 
of nominal unification equational nominal unification. In next sections we will 
describe a quadratic reduction of equational nominal unification to higher-order 
pattern unification. The absence of freshness equations makes the reduction to 
higher-order pattern unification simpler, compared with the reduction described in 
the preliminary version of this paper [Levy and Villaret 2008] . 

Definition 4.1. We define the translation of nominal unification problems into 
equational nominal unification problems inductively as follows: 

Eq{{a # ■ t} U P) = {a.b.t « b.b.t} U Eq{P) for some b ^ a 
Eq{{t « u} U P) = {t w u} U Eq{P) 

Lemma 4.2. Given a nominal unification problem P, its translation into equa- 
tional nominal unification Eq{P) can be calculated in linear time. Hence, Eq{P) 
has linear- size on the size of P. 

Lemma 4.3. The pair (V, cr) solves P, if, and only if, (V,cr) solves Eq{P). 

Proof. We first prove that (a#t. Id) is a solution of {a.b.t « b.b.t} when b 7^ a 

a#t 

: : (lemma 2.7) 

twt b#(ab)-t a#t 

^^^ («-abst-2) -^ (#-abst-2) 

b.t « a. a b -t a#b.t 

^^-j- ^1- ^— (~-abst-2) 

a.b.t « b.b.t 
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In this proof we prove t « t from an empty set of assumptions. We can prove 
that this is always possible, for any term t, by structural induction on t. We also 
prove b:^(a b)-t from a^^t, using Lemma 2.7 of [Urban et al. 2004]. 

Lemma 2.14 of [Urban et al. 2004] states that V h (7{V) and V h t « t' implies 
V h o-(t) « o-(t'). In particular, V h (T(a#t) and a#t h a.b.t ss b.b.t implies V h 
(T(a.b.t) « (j(b.b.t). Therefore, if (V, a) solves a#'t, then (V, a) solves a.b.t ss b.b.t. 

Second, analyzing the previous proof, we see that the inference rules applied in 
each situation were the only applicable rules. Therefore, any solution (V, a) solving 
a.b.t fa b.b.t, also solves a^^t, because any proof of cr(a.b.t) w cr(b.b.t) contains a 
proof of a#(T(t) as a sub-proof. 

From, these two facts we conclude that a^ 't and a.b.t « b.b.t have the same 
set of solutions, for any b 7^ a. Therefore, {a ^ 't} U P and {a.b.t « b.b.t} U P, also 
have the same set of solutions, for any nominal unification problem P. From this 
we conclude that P and Eq{P) have the same set of solutions. D 

Corollary 4.4. Nominal unification can be linearly-reduced to equational nom- 
inal unification. 

5. THE TRANSLATION ALGORITHM 

In this section we formalize the translation algorithm. We transform equational 
nominal unification problems into higher-order unification problems. Both kinds of 
problems are expressed using distinct kinds of signatures. In nominal unification 
we have sorts of atoms and sorts of data. In higher-order this distinction is no 
longer necessary, and we will have a base type for every sort of atoms v or sort of 
data S. We give a sort to types translation function that allows us to translate any 
sort into a type. 

Definition 5.1. The translation function is defined on sorts inductively as fol- 
lows. 

In X • • . X Tn -> rl = Inl -> • • • ^ iTnl ^ H 

where 6 and v are base types. 

Remark 5.2. The translation function for terms depends on all the atoms oc- 
curring in the nominal unification problem. We assume that there exists a fixed, 
finite and ordered list of distinct atoms (ai, . . . ,2„) used in the problem. This 
seems to contradict the assumption of a countably infinite set of atoms for every 
sort. However, this does not imply a loss of generality as far as every nominal 
unification problem only contains a finite set of atoms, and its solutions can be 
expressed without adding new atoms (this is a consequence of Lemma 6.11). Notice 
also that the nominal unification algorithm in [Urban et al. 2004] generates unifiers 
that do not introduce new atoms, because, in all transformation rules, the set of 
atoms in the right-hand side are a subset of the set of atoms in the left-hand side. 

From now on, we will consider this list given and fixed. 

In [Dowek et al. 2009; 2010] they solve this problem using a permission set for 
every variable. They roughly correspond to the set of atoms capturable by this 
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variable. However, in their case, this set is infinite and co-infinite. In our case, we 
prove that solutions can be expressed using the same finite set of atoms occurring 
in the problem, and the set of capturable atoms of a variable is finite and co-finite. 

For every function symbol f, we will use a constant with the same name /. 
Every atom a is translated as a (bound) variable, with the same name a. For every 
variable (unknown) X, we will use a (free) variable with the same name X . Trivially, 
atom abstractions a.t are translated as lambda abstractions Xa.t, and applications 
f (ti, . . . , tn) as applications /(ii, . . . , t„). The translation of suspensions tt-X is more 
complicated, as far as it gets rid of atom capture. Recall that in all cases we use 
distinct character fonts for symbols of nominal logic and symbols of the higher-order 
framework. The translation is parametric on a freshness environment. Notice that, 
although we have removed freshness equations, nominal unifiers are composed by 
a freshness environment and a substitution. 

Definition 5.3. Let (ai, . . . ,an) be a fixed ordered list of atoms. The transla- 
tion function from nominal terms with a freshness environments V into X-terms is 
defined inductively as follows. 

Wv = « 

If(ti,...,t,)l^ = /(Itil^,...,M^) 

la.tl^ = \a. W^ 

I^-Xl^ = Xd^-bil^ , . . . , I^-b^l^) where (hi, . . . , b^) = (a £ (aj, . . . , a,) | a # X ^ V) 

where, for any atom a : v, a : Ji^] is the corresponding bound variable, for any 
function symbol f : t, f : |t] is the corresponding constant, and for any variable 
X : T, the list (bi,...,bm) is the sublist^ of (ai,...,ap) composed by the atoms 
satisfying af/=X ^ V, and X : Ji^i] —>...—> I^ml -^ h'l ^^ ^^^ corresponding free 
variable, where bj : Vj.^ 

Lemma 5.4. For every nominal term t of sort t, and freshness environment V, 
|i]y is a X-term with type Jr]. 

Proof. The proof is simple by structural induction on t. The only point that 
needs a more detailed explanation is the case of suspensions. Since a\ : V], X : t, 

and X : Kl ^ > ^J ^ H, we have [X]^ = X ([aj^ , . . . , Ja.J^) : 

It}. When X is affected by a swapping (ai^ aij we also have [(aijaiJ-X| = 

^(•••^hJv.'Wv JvIv'-:-'kJv'hIv'kJv'---) ■■ W because 
the suspension is not a valid nominal term unless a-,, and ai^ belong to the same 

sort. The same applies to arbitrary permutations. D 

Example 5.5. Given the nominal term t ~ a.b.c.(ca)(a b)-X, after applying the 
substitution cr = [X h^ f(a,b, c,Y)] we get a-{t) =- a.b.c.f(b,c, a, Y). Let (a,b,c) be 
the (ordered) list of atoms of our problem. The translation of the term tw.r.t. Vi = 
results into Jt]y = Aa.A&.Ac.X(6, c, a) and, the translation of the instantiation 
a{\) w.r.t. V2 = {a=i^Y} results into |(T(t)]y = Aa.A6.Ac./(6, c, a, F(c, a)). There is 
a A-substitution \X ^^ Aa.A6.Ac./(a, 6, c, F(6, c))] (described in Definition 5.8) that 



^Notice that we say sublist, not subset, to emphasize that the relative order between a's is pre- 
served. 
''Notice that bj and 7r-bj are of the same sort. 
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when applied to |t]y results into |cr(t)]y . Graphically this can be represented as 
the commutation of the following diagram (proved in Lemma 5.11). 



a.b.c.(ca)(a b)-X 



[XH^f(a,b,c,Y)] 



a.b.c.f(b,c, a, (ca)(a b)-Y) 



Ha#n 



X X?, X ^(h ^ [X^\a.\h.\c.f{aAc,Y{b,c))] 

Aa.Ab.AC.X[b,c,a) ► Aa.Ao.Ac./(o, c, a, j (c, ajj 

Definition 5.6. Let (ai,...,ap) be an ordered list of atoms. The translation 
function is defined on equational nominal problems inductively as follows 

{t « u} U P] = {Aai Aa„.Itl(, = Xai Aa„.lu]0} U [PI 

Lemma 5.7. Given an equational nominal unification problem P, its translation 
|P] is a higher-order pattern unification problem. 

Moreover, the size and the time needed to compute [P] is bounded by the square of 
the size of P. 

Proof. By Lemma 5.4, Aai Aa„.|t]0 = Aai Aa„.|u]0 is an equa- 
tion between A-terms of the same type. Now notice that [tt-XJ^ = 
X (|7r-bi]y , . . . , |7r-bm]v) translate the variable X into an application of the free 
variable X to a list of pairwise distinct bound variables, because the b; are all dif- 
ferent, TT is a permutation, and wc ensure that all atoms are translated into bound 
variables by adding A-bindings in front of both terms. Therefore, both sides of the 
equation are higher-order patterns. 

Concerning the size of the translation, we obtain a quadratic bound due to the 
translation of these suspensions. D 

Finally, we have to translate solutions of nominal unification problems into A- 
substitutions. 

Definition 5.8. Let (ai, . . . , ap) be a fixed ordered list of atoms. Given a nom- 
inal substitution a, and a freshness environment V, we define the following trans- 
lation function 

Wv= U [^^Aai.--.Aa„.HX)lv' 

XeDom(<T) 

The following remark shows why in some places we require that solutions (V, a) 
of a nominal problem P satisfy Dom(cr) — Vars(P). 

Remark 5.9. Let (a, b) be the fixed list of atoms. 

Consider the nominal unification problem Pi = {a.X w b.Y}, and its translations 
as a higher-order pattern unification problem 



IPil 



{a.X « b.Y} = {\a.\b.\a.X{a, b) = Xa.Xb.Xb.Y{a, b)} 
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The \- substitution 

a, = I[X ^ (a b).Y]l^^^Y} = [X^ Xa.Xb.Yia)] 

does not solve JPi]. Whereas the X- substitution 

CT2 = I[X^ (ab)-Y,YH^ Y]]{^_^Y} = [X ^ Xa.Xb.Y{a), Y ^ Xa.Xb.Y{b)] 

solves |Pi] . Notice that in the first case the domain of the nominal unifier (as 
defined in Section 2) is {X}, whereas in the other case it is {X, Y} = Vars(Pi). 

We will see (Theorem 5.13) that, i/Vars(P) C Doiii(cr) and (V,(t) solves P, then 
Icrjy solves [P]. With this example we see that the first condition in the implication 
is necessary. 

Now, consider the nominal unification problem P2 = {a.b.(ab)X « b.b.(a b)X}, 
and its translation as 

[P2I = {Xa.Xb.Xa.Xb.X{b,a) = Xa.Xb.Xb.Xb.X{b,a)] 

In this case, the pattern substitution cti is a most general pattern unifier of IP2]; 
and a-i is a pattern unifier, but not a most general one. 

Therefore, we have to require Vars(P) D T)om.{a), if we want to ensure that the 
translation not only preserves unifiability, but also most generality. 

Notice that w.l.o.g. we can require most general nominal solutions to satisfy 
Vars(P) = Dom((T), because most general solutions do not instantiate variables not 
belonging to Vars(P), and we can always add pairs X h-> X for all variables occurring 
in P and not in Dom(cr). 

Notice also that in (72 there are two free variables with the same name Y , but 
distinct types. Be aware that inY 1-^ Xa.Xb.Y{b) the replaced Y has two arguments, 
whereas the introduced Y has only one argument (they have distinct types). In X- 
calculus this is not a problem. The reason of this duplicity is that the translation 
function is parametric on a freshness environment V. This is relevant in the case 
of a nominal variable. For instance, [Y]™ — Y{a,b) where we use the replaced Y 
with two parameters, and |Y] r /^^yi — Y{b) where we use the introduced Y with one 
parameter. If we would like to avoid this duplicity we have to forbid the use of a 
variable of the problem in the right-hand side of a nominal solution. Then, in our 
example Pi, the most general nominal solution could be written as ({a#Y'}, [X H> 
(ab)Y',YH^ Y']). 

To prove that the translation of the solution of a problem is a solution of the 
translation of the problem, we start by proving the following two technical lemmas. 

Lemma 5.10. For any freshness environment V, nominal terms t, u, and atom 
a, we have 

(1) V h a #t i/, and only if a ^ FV(|t]^), and 

(2) V h t w u if and only if [[t]^ =„ [[u]^. 

Proof. The first statement can be proved by routine induction on t and its 
translation. Notice that atoms are translated nominally into variables and that 
the binding structure is also identically translated, hence, the freshness of an atom 
a corresponds to the free occurrence of its variable counterpart a. We here only 
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comment the case t = tt-X, in this case, |7r-X]y — X (|7r-bi]y , . . . , |7r-bm]y), 
where bi#X ^ V, for any i G {l..m}. Therefore, we can estabhsh the foUowing 
sequence of equivalences V h a^^Tr-X iff tt"-' -a^X € V iff tt"-*- -a ^ {bi, . . . , bm} iff 
a ^ {TT-bi, . . . , TT-b^} iff a ^ FV(X(|^-bilv , • ■ • , I^-bmlv)) iff a ^ FV([7r-X]). 

The proof of the second statement can be done by induction on the equivalence 
t w u. We only comment the equivalence between suspensions: tt-X w tt'-X. Notice 
that, TT-X « tt'-X if, and only if, for all atoms a such that 7r-a 7^ 7r'-a, we have 
a#X € V. This condition is equivalent to: the bound variables |7r-a]y and |7r'-a]y 
are passed as a parameter to \ in |7r-X]y and [tt'-XJ^ only when 7r-a = 7r'-a. 
Finally, this condition is equivalent to [tt-X] ^^ = [tt'-X] y. D 

The first statement of the previous lemma will not be necessary for our purposes 
because we have removed freshness equations. 

Lemma 5.11. For any freshness environment V, nominal substitution a, and 
nominal term t satisfying Vars(t) C Dom(cr), we have Iclydtjn) — |cr(t)]y. 

Proof. Again this lemma can be proved by structural induction on t. We only 
sketch the suspension case. Let t = tt-X. We have the equalities: 

Hv(I^-Xl0) = [- . - ,X ^ Aai . . . Aa„ . Ia(X)lv , - - -] (X(b-ailv , • - - , I^r-anlv)) 

= (Aai...Aa„.Icr(X)]^) (l7r-ai]^ , . . . , |7r-an]^) 

= [ai h^l7r-ai]^,...,a„H^ iTT-an]^] ([tT(X)]^) 

= I'r-tT(X)l^ 

= M^-^v 

Notice that in the first equality we use X G Vars(t) C Dom(CT), hence X e 
Dom(Hv)- n 

Example 5.12. Let be t = f((a b)-X, (a b)-Y), V = {b#Y} and a =[X^ b.a, 
Y H^ Y] . We will have 

Hv = I[X ^ b.a, Y ^ Y]1{,^Y} = [^ ^ Aa.A6.Ib.al{,^Y}, Y ^ Aa.Afe.IYl^.^Y}] 
= [X ^ \a.\b.\b.a, Y ^ \a.\b.Y{a)] 

1% = |f((ab)-X,(ab)-Y)lj, = /(X(6,a),y(fe,a)) 

Ia(t)l^= |[X ^ b.a, Y .^ Y] f ((a b) -X, (a b) - Y)] ^^^^i = l^i^-^^ (^ b) ' Y)l{b#Y} 
= f{\a.b,Y{b)) 

Now, we have 

H V ( W0) = f{ {\a.\b.\b.a)(h,a) , {\a.\b.Y{a)){b, a)) - /(Ac.6, Y{b)) = f{Xa.b, Y{b)) 

= k(t)lv 

Notice that the substitution resulting form the /3-reduction of the underlined 
redex needs to avoid a capture of 6. This is done replacing the bound variable b 
by c. In the following section we will see that, in pattern unification, we can do 
this without using new bound variable names. In this case, we could have used a 
instead of c. 

From these two lemmas we can prove the following results. 
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Theorem 5.13. For any freshness environment V , equational nominal unifica- 
tion problem P , and nominal substitution a with Vars(P) C Doin(cr) , we have that 
(V,(7) solves the equational nominal unification problem P, if, and only if ^crj^ 
solves the pattern unification problem |P]. 

Proof. By definition of nominal solution, the pair (V, a) solves P iff 

V h cr(t) « cr(u) for all t w u e P 

By Lemma 5.10 this is equivalent to: 

Ia(t)lv =„ Ia(u)l^ for aU t « u e P 

and, by Lemma 5.11 this is equivalent to: 

Hv(W0) = Hv(H0) forallt«uGP 

Since the substitution |cr]y does not instantiate the variables ai,...,a„, this is 
equivalent to (see Remark 5.14): 



Aai Aan.JtL) = [[o-]^(Aai Aa„.[[u 



for all t « u e P 



where (ai, . . . , an) is the list of atoms occurring in P. 



Finally, since 
to IcrJv solves IP'], n' 



Afli Aa„.Jt]0 = Afli Afln.JuJg, this is equivalent 



The proof of Theorem 5.13 also allows us to prove that (V,cr) solves t w u, if, 



and only if, |cr].^ solves |t] 



|u]n. Therefore, it seems unnecessary to add the 



A-bindings Aai. • • • .Aa„ in front of both sides of the higher-order equations, as was 
suggested in Example 3.4. The following remark illustrates what would happen if 
we had defined translation of equations in this way. 



Remark 5.14. Assume that we had defined 



W0 - [[ul0, instead of 



the definition we have for 



with the external lambda's. 



The translation of the unsolvable nominal equation a « b would result into a = b 
which is solvable by [a 1— >■ b] (notice that, in this case, atoms are translated into free 
variables). The example does not contradict Theorem 5.13 because the substitution 
[a I— !> b] is not the translation of any nominal substitution, i.e. there does not exists a 
freshness environment V and a nominal substitution n such that |(T]y = [a H> 6]. // 
we introduce the external X-bindings we get the unsolvable higher- order unification 
problem Xa.Xb.a ~ Xa.Xb.b. 

On the other hand, the translation of the solvable nominal equation of Exam- 
ple 3.4 would be 

PI = |{a.b.f(b,X6) w a.a.f(a,X7)}| = {Xa.Xb.f{b,Xe{a,b)) = Xa.Xa.f{a,X7(a,b))} 

that is not a higher-order pattern unification problem (notice that Lemma 5.7 does 
not hold if we do not introduce the external X-bindings). 
The translation of its nominal most general solution is 

H^ = I[X6 ^ (b a).X7]l{b#x,} = [^6 ^ Xa.Xb.Xj{b),Xj ^ Xa.Xb.Xr{a)] 
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In this case, Jcr]y is a higher-order unifier of |P], as Theorem 5.13 predicts. How- 
ever, it is not a most general unifier, and we are interested in translating most 
general solutions into most general solutions. 

Theorem 5.15. If the equational nominal unification problem P is solvable, then 
the higher-order pattern unification problem JP] is also solvable. 

Proof. The theorem is a direct consequence of Theorem 5.13. D 

The opposite imphcation of Theorem 5.15 can not be directly proved from Theo- 
rem 5.13, because |P] should have solutions that are not of the form Jcrjy , for any 
solution (V,ct) of p. 

5. SOME PROPERTIES OF PATTERN UNIFICATION 

In this section we prove some fundamental properties of Higher-Order Pattern Uni- 
fication. In particular, we prove that we can express most general unifiers of pattern 
unification problems only using bound-variable names and types already used in 
the problem. This property is used in next sections in the translation of pattern 
unifiers into nominal unifiers. 

In the following example we note that in the solution of pattern unification prob- 
lems it is important to save names of bound variables. In the following we will 
distinguish between variables and variable names. For instance Xx.Xx.x has three 
occurrences of variables, two distinct variables, with one unique variable name. 
Notice that a-conversion preserves the number of variables, but may change the 
number of names. 

Example 6.1. Consider the nominal problem a.X w a.f(b.Y). Its transla- 
tion is \a.Xb.\a.X{a,b) = Xa.Xb.Xa.f{Xb.Y{a,b)). An a-conversion results in 
Xa.Xb.Xc.X{c,b) = Xa.Xb.Xc.f{Xd.Y{c,d)) and it shows that the parameters oi X 
and Y are in fact different. A most general solution is [X i-^ Xc.Xb.f{Xd.Y{c,d))]. 
Since Y is translated as Y{a, b), we would have to translate back Y{c, d) as (a c)(d b)- 
Y. And, since substitutions like [X n> t] are translated as [X i-> Xa.Xb. [[t]y], we 
would have to translate back [X i—> Xc.Xb. [[t]^] as [X t-^ (ac)-t]. Therefore, our 
pattern unifier had to be translated back as [X h^ (a c)-f(d.(a c)(d b)-Y)]. However, 
the list of atoms is fixed as the list of atoms occurring in the problem, hence, we 
know how to translate a and b as a and b and vice versa, but we do not know how 
to translate back c and d. Here it is done introducing new atoms. However, the 
use of an infinite list of atom names would imply that the list of arguments of a 
variable (the list of capturable atoms) would be infinite. 

If we look at Nipkow's transformation rules described in Subsection 2.2, it seems 
that no new bound- variable names are introduced. However, this is not true. There 
are three places where their introduction is hidden. In the following we illustrate 
these cases. 

(1) It is assumed that equations have the same most external A-bindings, i.e. 
that they are of the form Xx.s = Xx.t. If this is not the case, we have 
to a-convert one of the sides. However, this is not always possible without 
introducing new bound-variable names. For instance, if we have the equa- 
tion Xx.Xy.Xy.X{x,y) = Xy .Xy .Xx .Y [x , y) , after a-converting the two most 
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external A-binder, we get \x.\y.\y.X{x,y) = \x.\y.\x.Y{x,y), that needs 
a new bound-variable name to obtain the same A-binders in both sides, by 
means of a-conversion. Using a new name z we would get Xx.\y.\z.X{x,z) = 
Xx.Xy.Xz.Y{z,y). 

(2) In the flex-rigid rule the terms Ui may not be of first-order type. In this 
case, we need to 77-cxpand some subterms. For instance, the rule transforms 
Xx.X{x) = Xx.f{Xx.g{x)) into the equation Xx.Xi{x) = Xx.Xx.g{x) and the 
substitution [X h- > Xx.f{Xi{x))\ . The left-hand side of the equation needs to 
be 77-expanded, and wc can not use the name x. Using a new name z, and 
a-converting we would get Xx.Xz.Xi{x, z) = Xx.Xz.g{z). 

(3) When we compute a substitution for a variable, it must be applied 
to all the occurrences of the variable, and this may involve a /?- 
reduction. Some /3-reductions need to introduce new names to avoid 
variable-captures. For instance, if we have the equations {Xx.Xy.X{x,y) = 
Xx.Xy.f{Xx.Y{x,y)), Xx.Xy.Z{x,y) = Xx.Xy.X{y,x)j , after solving the first 
one we get \X n> Xx .Xy . f {Xx .Y {x , j/))l that must be substituted in the second 
equation. We get, Xx.Xy.Z{x,y) = Xx.Xy.(^Xx.Xy.f{Xx.Y{x,y))){y,x). The /?- 
reduction using the standard substitution algorithm introduces a new name z to 
avoid the capture of the variable x, giving Xx.Xy.Z{x, y) = Xx .Xy . f {Xz .Y {z , x)) 

In the following we show how we can overcome these problems. One of the ideas 
is using a kind of swapping for A-calculus, instead of the usual substitution, like it 
is done in nominal terms. 

Definition 6.2. Given two variables x,y, and a X-term t, we define the swap- 
ping of X and y in t, noted by {xy)-t inductively as follows 



[xy)-x = y 




{xy)-y = X 
{xy)-z = z 
{xy)-c = c 
{xy)-(Xz.t)^ 
{xy)-(a{ti,.. 


if z ^x,y 

X{{xy)-z).{{xy)-t) 
.,i„)) = ((xy)-a)((xy)-ti 


ere c is a constant and a 


is a constant or a variable 



,{xy)-tr. 



Notice that this swapping is distinct from the swapping on nominal terms. In 
particular {ab)X = X, and we do not keep suspensions. In some cases its ap- 
plication results into an a-equivalent term, but in general the result is a different 
term. 

Remark 6.3. In X-calculus, following the Barendregt variable convention, oper- 
ations are defined on classes of a-equivalent terms, rather than on particular terms. 
This, for instance, allows us to freely a-convert terms in substitutions in order to 
avoid variable capture. Therefore, (although it is often omitted) we have to prove 
that the operation is independent of the representative of the class that we take. The 
previous swapping operation is defined for particular terms. However, the following 
lemma ensures that it can be extended to a-equivalent classes of terms. Barendregt 
variable convention suggests to use distinct variable names for distinct variables. 
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Here, since we try to avoid the introduction of new variable names, we do not use 
the convention, and work with particular terms. 

Lemma 6.4. For any term t and variables x and y, we have 

(xy)-t —a [x ^^ y,y ^^ x]t 

where [x ^-^ y,y ^^ x\ changes x by y and y by x in t, simultaneously. 
In particular, if x,y ^ FY(t), then {xy)-t —a t. 

Proof. By structural induction on t. For one of the cases of A-abstraction, for 
instance, we have 

{xy)Xx.t —\y.{xy)t By induct, hypothesis 

— \y.[x ^^ y,y i-^ x]t Let be z ^ FV(t) U {x, y} 

= Xy.[z (— > j/][j/ 1—> x][x n> z]t Since y ^ FY{[y h-> a;][x t-^ z]t) 

=Q, Xz.[y I— > x][x I— > z]t Since z ^ x,y 

= [y M. x\\z.[x !-!> z\t Since z ^ FV(i) 

=a [y ^^ x\\x.t Since x ^ FV(Ax.t) 

= [x ^^ y,y ^^ x\\x.t 

D 

Lemma 6.5. If y is a list of pairwise distinct variable names^ , \y\ = \x\ ~ n 
and {y} n FV(Af.i) = 0, then 

(\x.t){y) = n„(f,y)-t 

where n„ (x, y) is a permutation on the names x, y defined inductively as 

n,{{x),{y)) = {xy) 

n„((a;i, . . . , x„), (yi, . . . , y„)) == n„_i(((a;i yi)-X2,..., {xi yi)-Xn), (2/2, • • • , yn))-{xi yi) 

Proof. By induction on the length n of both vectors. Obviously, the variable 
xi is not free in Axi.Ax2, . . . ,a;„.i. By assumption, the variable yi is neither free 
in this term. 

From FV(Aa:;2, . . . ,a;„.t) C FV(Aa;.i) U {xi}, and xi,yi ^ FY{Xx.t), we have 
FV((a;iyi)-(Aa;2,...,x„.t)) C FY{Xx.t) U {yi}. Since yi ^ {y2,...,y„} and {y} n 
FY{Xx.t) = 0, we have {y2, . . . ,y„} n FV((a;i yi)-(Aa;2, . . . ,a;„.i)) = 0. Therefore, 
we can apply the induction hypothesis to the term (xi yi) • (Aa;2, . . . , x„.i) and the 
vector (y2, . . . , y„), obtaining 

iXx.t){y) =a {Xyi.{xiyi)-{Xx2, . . . ,Xn.t)){yi,y2, . . . ,yn) By Lemma 6.4 

=/3 {{xi yi) ■ (Ax2, . . . , x„.i))(y2, ...,yn) By /3-reduction 

= (A(xiyi)-X2,...,(xiyi)-x„.(xiyi)-t)(y2,...,y„) By def. of swapping 

= n„_i(((xiyi)-X2,...,(xiyi)-x„), (y2, . . . , y„}) -(xi yi)-i By induct, hypothesis 
= n„(x,y)-t 

D 



^Notice that we do not require x to be pairwise distinct. If they are also pairwise distinct, then 
n„(x, y) = {x„ y„) ... (xi yi). 
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Now we will describe a variant of the higher-order pattern unification algorithm 
of Section 2.2. In this variant, external A-binders are a-converted explicitly and 
the hex-rigid rule has been replaced by a new rule where Ty-expansion is made 
explicit, i.e. the terms m are base-typed, thus the right-hand side does not need 
to be 77-expanded, like in the original rule. Moreover, /3-redexes are removed using 
swappings, according to Lemma 6.5, since we are dealing with patterns. 

Definition 6.6. We assume unoriented equations and define the following set 
of transformation rules over higher-order pattern equations: 

a-transformation: 

Xw.Xx.t == Xw.Xy.u -^ (^Xw.Xx.t = Xw.{x y)-{Xy.u), [ ]) 
if X ^ FV(u) 

Xw.Xx.t = Xw.Xx.u — > (Xw.t i Xw.u, [ ]') 

if X ^ FV(t) and x ^ FV(u) 

Xw.Xx.t i: Xw.Xx.u — > (^Xw.Xx.t ^ Xw.Xx.u, [X ^-> Xy.Z{z)]^ 
if X ^ FV(t), X{y) is a subterm of u, 
X e {y} and {z} = {y} \ {x} 

Rigid-rigid: 

Xw.a{ti, . . . , t„) = Xw.a{ui, . . . , m„) -^ {^{Xw.ti == Xw.ui, . . . , Xw.tn — Xw.Un], [ ]) 

Flex-rigid: 

Xw.X(x) ^ Xw.a{Xyi.ui,. . . ^Xym.Um) ^ ({ Xw.Xy{.Xi{z{) ^ Xw.Xy{.ui , 

Xw.Xy7n.Xm{Zm) - Xw.XyTn.Um}, 

[X ^ Xx.a{Xyl.Xi{zl), . . . ,XyZi.X.^{z':'m))\) 
ii X ^ FV{ui), a is a constant or a G {x}, 
and {zi} — {x} U {yl}, for « = 1, . . . , to. 
Flex-flex: 

Xw.X{x) ^ Xw.X{y) -^ (0, [X ^ Xx.Z{z)]) 

where {z} = {xi \ Xi = j/J 

Xw.X{x) = Xw.Y{y) -^ (0, [X ^ Xx.Z{z),Y ^ Xy . Z{z)]) 
where X y^Y and {z} = {x} f) {y} 

These transformations are applied as follows. The equation on the left-hand side 
is replaced by the equations in the first component of the right-hand side, and then 
the substitution in the second component of the right-hand side is applied to all 
the equations. If this substitution introduces j3-redexes, they are removed using 
swappings, according to Lemma 6.5. Moreover, all the substitutions are composed 
to compute the resulting unifier. In other words, the transformation is applied 
as follows ({e} U E,a) — > {(^'{E' U E) i/3,cr' o a), if we have a transformation 
e^{E',a'). 

With the following examples, we illustrate how these rules solve the problems 
concerning the introduction of new bound variable names described previously, at 
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the beginning of this section. 

Example 6.7. Given the equation Xx.Xy.Xy.X{x,y) = \y.Xy.Xx.Y{x,y) the 
apphcation of the first a-transforniation rule gives us Xx.Xy.Xy.X(x,y) = 
Xx.Xx.Xy.Y(y,x). A second apphcation of this a-transformation gives us 
Xx.Xy.Xy.X(x,y) = Xx.Xy.Xx.Y{x,y). Now, the first a-transformation rule is 
no longer applicable. However, we can apply the third a-transformation rule, 
that instantiates [X i— > Xx.Xy.X'{y)], and gives the equation Xx.Xy.Xy.X'{y) = 
Xx.Xy.Xx.Y{x,y). Now, applying the second a-transformation rule, we obtain 
Xy .Xy .X' [y) = Xy.Xx.Y{x, y). Again, we can apply the third a-transformation rule, 
that instantiates \Y h-> Xx.Xy.Y'{x)\, and gives Xy.Xy.X'{y) = Xy.Xx.Y'{x). The 
first a-transformation rule gives Xy.Xy.X'ijj) = Xy.Xy.Y' {y). Finally, the second 
a-transformation rule gives Xy.X'{y) = Xy.Y'{y). 

This last equation can be solved applying the second flex-flex rule. The resulting 
unifier is 



[X' ^ Xy.Z(y), Y' ^ Xy.Z(y)] o [Y ^ Xx.Xy.Y'{x)] o [X <^ Xx.Xy.X'{y)] 
= [X ^ Xx.Xy.Z(y), Y ^ Xx.Xy.Z(x)] 



{X.Y} 



Example 6.8. The new flex-rigid rule transforms Xx.X(x) = Xx.f(Xy.a) 
into the equation Xx.Xy.Xi{x,y) = Xx.Xy.a and the substitution [X i—> 
Xx.f{Xy.Xi{x,y))]. The original flex-rigid rule would give us Xx.Xi{x) = Xx.Xy.a, 
that conveniently 77-expanded using the same variable name y, results into the 
same equation. A further application of the flex-rigid rule solves the equation by 
[Xi n> Xx.Xy.a]. 

In other cases, the resulting equation may be different. The new rule transforms 
Xx.X{x) ^ Xx.f{Xx.g{x)) into the equation Xx.Xx.Xi{x) == Xx.Xx.g{x) and the 
substitution [X i— > Xx.f{Xx.Xi{x))]. However, the original flex-rigid rule would 
give us Xx.Xi{x) = Xx.Xx.g{x) and the substitution \X i-t- Xx.f{Xi{x))]. In the 
subsequent ry-expansion we can not use the name x, and we need a new name z, 
and a-conversion of the right-hand side getting Xx.Xz.Xi{x, z) — Xx.Xz.g{z). Both 
equations are obviously distinct. However, to solve this second equation, Xi can 
not use the first argument, because it is not used in the right-hand side. Therefore, 
we can instantiate Xi 1— > Xx.Xy.X[{y), and a-convert the new variable name z, 
getting the same equation as with the new hex-rigid rule. 

Example 6.9. Given the equations {Xx.Xy.X{x,y) = Xx.Xy.f{Xx.Y{x,y)), 
Xx.Xy.Z{x,y) = Xx.Xy.X{y,x)j , after solving the first equation and replac- 
ing \X H- > Xx.Xy.f{Xx.Y{x,y))] into the second one, we get Xx.Xy.Z{x,y) = 
Xx .Xy .({Xx .Xy .f {Xx.Y {x, y))){y , x)) . By Lemma 6.5, we can /3-reduce using swap- 
pings, instead of the usual standard substitution. The permutation will be 
^2{{x,y),{y,x)) = Ui{{{xy)-y),{x))-{xy) = {xx)-(xy) = (xy), and the result 
of the /3-rcduction will be 

{Xx.Xy.f{Xx.Yix,y))){y,x) ^p {x y) ■ f (Xx .Y {x , y)) - f {Xy .Y {y , x)) 

Lemma 6.10. The algorithm described in Definition 6.6 is sound and complete 
and computes a m,ost- general higher- order pattern unifier whenever it exists, when 
names oj free and hound variables are disjoint. 
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Proof. The algorithm computes basically the same most general unifiers than 
the Nipkow's algorithm. 

The fact that we use swapping instead of substitution to remove /3-redexes is not 
a problem according to Lemma 6.5. We will obtain a term that is a-equivalent to 
the one that we would obtain with the traditional capture-avoiding substitution. 
Notice that in the lemma we require arguments of free variables (the sequence y) 
to be a list of distinct bound variables. This is ensured in the case of higher-order 
pattern unification, but it is not true in the general A-calculus. The algorithm 
preserves the disjointness of bound and free variable names. Therefore, the other 
condition of the lemma {y] fl FV(Aa;.i) is also satisfied. 

In the third a-transformation rule, if a; ^ FV(f) and x G FV(u) and the equation 
is solvable, then x must occur in u just below a free variable, as one of its argu- 
ments, and this free variable must be instantiated by a term that does not use this 
argument. Notice also that the three a-transformation rules, when the equation 
is solvable, succeed in making the lists of most external A-bindings equal in both 
sides of the equation. 

In the case of the flex- rigid rule, we may obtain an equation \x.Xi{xi, . . . , x„) = 
\x.\y.u[ that needs to be 77-expanded, and where {xi, . . . ,x„} n {y\ ^ 0. Let 
be {x\, . . . ,x'^i} — {xi^ . . . ,Xn} \ y, i-e. the sequence of variables xis not in 
y. In any solution of this equation Xi can not use the variables of the inter- 
section of {xi, . . . ,x„} n {y}. Therefore, we can extend the solution with Xi h- > 
\xi , . . . Xn-^y-X[{x'i , . . . , x^, j7), and get the equation \x.Xy.X[{x'i , . . . , x'^t , y) = 
\x.\y.u[. 

The flex-flex and rigid-rigid rules are the same as in Nipkow's algorithm. D 

Lemma 6.11. Let P be a solvable pattern unification problem, where the set of 
free and bound variable names are disjoint, and let (ai,...,a„) be a list of the 
names of bound variables of the problem. Then, there exists a most general unifier 
a such that 

(1) (7 does not use other bound-variable names than the ones already used in the 
problem, i.e than {ai, . . . , a„}. 

// in the original problem all bound variables with the same name have the same 
type, i.e. we have a type Ti for every bound variable name a^, then 

{2) the same applies to a, i.e. any bound variable of a with name Oi has type Ti, 
and 

(3) any free variable X occurring in a has type vi — > • • • — > Vm — > v, where 
iyi, . . . , Vm) is a sublist 0/ (ti, . . . , t„). 

Proof. By Lemma 6.10 with the new transformation rules we obtain most gen- 
eral unifiers for solvable pattern unification problems. Then, by simple inspection 
of the new transformation rules, where all bound variable names in the right-hand 
sides of the rules are already present in the left-hand sides, we have that new equa- 
tions and substitutions do not introduce new names. In addition, since names of 
free and bound variables are distinct, /3- reductions due to substitution applications 
satisfy conditions of Lemma 6.5, therefore we can conclude that we do not need 
new bound variable names due to /3-reductions either. 
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Notice also that in these rules, when we introduce a new bound variable in the 
right-hand side, with a name already used in the left-hand side, both variables have 
the same type. When, we swap two variable names in an a-conversion or in a 
/3-reduction, they have also the same type. 

Finally, let a' be any most general unifier not using other bound variable 
names than the ones used in P, i.e. ai,...,a„. For every variable X oc- 
curring free in a^ chose one of their occurrences. This will be of the form 
X(6i, . . . , 6m), where {6i,...,fem} C {ai,...,a„} and the 6i's are pairwise dis- 
tinct. Let (b^/i), . . . , &7r(m)) be a sublist of (ai, . . . , a„). Then composing a' with 
[X' K^ Xbi. ■ ■ ■ .A6m.X(6^(-i), . . . , 67r(m))]j for every variable X, we get another most 
general unifier fulfilling the requirements of the third statement of the lemma. No- 
tice that, although not all occurrences of X have the same parameters, it does not 
matter which one we chose because all them have the same type. D 

7. THE REVERSE TRANSLATION 

As we have shown. Theorem 5.13 is not enough to prove that, if |P] is solvable, 
then P is solvable. We still have to prove that if |P] is solvable, then for some 
solution (T of |P] we can build a nominal solution (V, a') of P. This is the main 
objective of this section. Taking into account that |P] is a higher-order pattern 
unification problem, and that these problems are unitary, we will prove something 
stronger: if |P] is solvable, then |(t]~ is defined for the most general unifier a of 
|P]. Moreover, in the next section we will prove that |cr]~ is also a most general 
nominal unifier. 

Definition 7.1. Let (ai, . . . , an) be a fixed ordered list of atoms, and let V be 
a freshness environment. The back-translation function is defined on X-terms in 
rj-long ^-normal form as follows: 



\) 



where n is a permutation on (ai, . . . , an) satisfying 
(tt-Ci, . . . , TT-Cm) is the sublist of (ai, . . . , an) such that 
7r-Ci#X ^ V and C\ and tt-Cj have the same sort 

where a is a bound variable with name a, / is the constant associated to the function 
symbol f , either X is the free variable associated to X, or if X is a fresh variable 
then X is a fresh nominal variable, and the permutation tt^^ is supposed to be 
decomposed in terms of transpositions (swappings). 

Notice that the back-translation function is not defined for all A-terms, even for 
all higher-order patterns. In particular, |Aa:;.i] is not defined when x is not base 
typed, or |a:;(ti, . . . , i„)] is not defined when a; is a bound variable. 

Notice also that the permutation tt is not completely determined by the side 
condition of the forth equation. For instance, given (ai, a2, as) as the list of atoms, 
all them of the same sort, to define |X(ai)]~ r^ Jtx a #x} ~ tt^^ -X the condition 
requires tt-qi = 03, but then, we can choose 7r-a2 = ai and ir-a^ — 02, or vice versa 
TT-a2 = 02 and n-a^, = ai. Therefore, Jt] y is nondetcrministically defined. 
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For A-substitutions the back-translation is defined as follows. 

Definition 7.2. Let (ai, . . . , an) be a fixed ordered list of atoms, and let \7 be a 
freshness environment. The back-translation function is defined on \- substitutions 
as follows. 



XeDom(cr) 



, • • • ,"ny 



Notice that if a{X){ai, . . . ,an) is not a well-typed A-term, or 
|cr(X)(ai, . . . ,a„)]y is not defined for some X G Dom(CT), then Icr]^ is 
not defined. 

We introduce the following notion to describe which A-terms and substitutions 
have reverse translation w.r.t. a freshness environment. 

Definition 7.3. Given a X-term t (resp. \- substitution a), and a freshness en- 
vironment V, we say that t (resp. a) is ^-compatible if Ji] ^ (resp. l^aj ^) is 
defined. 

Lemma 7.4. For any X-term t, and freshness environment V, if t is V- 
compatible, then \t\ y = t. 
For every X- substitution a , and freshness environment V, if a is V -compatible, then 



Proof. Let (ai, . . . , ap) be a fixed ordered list of atoms. The existence of |i] ^ 
restricts the form of t to five cases. For the first four, the proof is trivial. In the 
case t — X{ci, • • • , Cm), we have 



I^(ci, 



' ' ' 1 ^n 



= X{ci,- ■ ■ ,Cm) 



IT -TT-Cn 



where tt is a permutation on (ai, . . . , a„) satisfying (tt-ci, . . . , 7r-Cm) is the sublist of 
(ai, . . . , ap) such that tt-Cj^^X ^ V and c; and tt-c; have the same sort. 
For the second statement, by Definitions 7.2 and 5.8 we have 



U [X^Ia(X)(ai,---,a„) 

XeDom(<T) 



-U V 



|cr(X)(ai,---,a„) 



= 1) \x ^^ Afli ■ ■ ■ a. 

XeDom(<T) 

= [J [X h^ Xai- ■ ■an.a{X){ai,- ■ ■ ,an)] 

XgDom((T) 

[J [X ^ a{X)] ^ a 

XeDom(<j) 

Where we make use of the first statement to prove |cr(X)(ai, 

a{X){ai, • • • , a„). Notice that, if a is V-compatible, then a{X){ai, • • • , a„) is also 
V-compatible. D 
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Given a pattern unifier, in order to reconstruct the corresponding nominal uni- 
fier, we tiave several degrees of freedom. We start with higher-order pattern unifier 
(7 with a restricted use of names of bound variables. Then, we will construct a 
freshness environment V such that a is V-compatible. This construction is de- 
scribed in the proof of Lemma 7.6, and it is nondeterministic. The corresponding 
nominal solution is then (V, l^aj y). Moreover, |cr] ^ is not uniquely defined. 
The following example illustrates these degrees of freedom in this back-translation. 

Example 7.5. The nominal unification problem 

P = {a.a.X « c.a.X , a.b.X « b.a.(a b)-X} 

where all atoms and variables are of the same sort, is translated as 

JP] = { Xa.Xb.Xc.Xa.Xa.X{a,b,c) = Xa.Xb.Xc.Xc.Xa.X{a,b,c) , 
Xa.Xb.Xc.Xa.Xb.X{a,b,c) ~ Xa.Xb.Xc.Xb.Xa.X {b, a, c) } 

Most general higher-order pattern unifiers are 

ai = [X ^ Xa.Xb.Xc.Z{a,b)] 

and 

(72 = [^ ^ Xa.Xb.Xc.Z{b,a)] 

which are equivalent. 

Let (a, b, c) be the fixed list of atoms. Following the construction described in the 
forthcoming proof of Lemma 7.6, for every variable Z occurring in a, we construct 
a sublist of atoms Lz = (bi, . . . , bm) satisfying bj : |rj] , for every j = 1, . . . , m. 
In our case, we can choose among three possibilities L^ = (a,b), L^ = (a,c) or 
Li = (b,c). We construct V = I I „ {a#Z}. 



Z occurs in a 
a e (ai, . . . ,a„) \ Lz 



From the two pattern unifiers Ci's, and the three lists L'^'s we can construct six 
possible nominal unifiers: 







(Tl 




a-2 




L'z 


({c#Z} , 
({b#Z} , 
({a#Z} , 


\hcaj 


•Z]) ({c#Z}, 
•Z]) ({b#Z} , 
•Z]) ({a#Z} , 


ix^(L-y 


'•z]) 

'•z]) 

•z]) 



The permutations can be written as swappings obtaining: 







CTi 


0-2 


L'z 


({c#Z} , 
({b#Z} 
({a#Z} , 


[X^Z]) ({c#Z}, 
[X^(bc).Z]) ({b#Z} 
[X^(ac)(bc).Z]) ({a#Z}, 


[X^(ab).Z]) 

[X^(ab)(bc).Z]) 

[X^(ac).Z]) 



All these nominal unifiers arc most general and equivalent. Notice that these are 
all the most general nominal unifiers. 
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Lemma 7.6. For every equational nominal unification problem P, ij the pattern 
unification problem JP] is solvable, then there exists a freshness environment V, 
and a most general pattern unifier a, such that a is V -compatible. 

Proof. The most general unifier a is chosen, accordingly to Lemma 6.11, as a 
unifier not using other bound variable names than the ones used in |P]. Moreover, 
since all bound variables of |P] with the same name a, have the same type r^, the 
same happens in a, and all free variables Z occurring in a have a type of the form 
Z : Til —;■... —^ Ti^ — > 6, for some indexes satisfying 1 < ii < ■ ■ ■ < im < n. Notice 
that there could be more than one set of indexes satisfying this condition. 

The freshness environment V is constructed as follows. For any variable Z : 
Ti-^ —>■... Tj^ — > 5 occurring® in a , let Lz = (aij , . . . , a;^) be a sublist of the atoms 
(ai,...,an). Then, 

V = U {a#Z} 

Z occurs in a 
a e (ai, . . . ,3„)\Lz 

We prove that (j{X){ai, . . . , a„) is V-compatible, for any X G Dom(cr). 

Since a is most general Dom(cr) only contains variables X of |P]. All these 
variables have type |ti] ^ • • ■ — > Jt„] — ^ Jro], where (n, . . . , t„) is the list of sorts 
of (ai, . . . , an), and tq is the sort of X. Therefore, a{X){ai, . . . , a„) is a well-typed 
A-term. Now we prove that this term is back-translatable by structural induction. 

By Lemma 6.11, ct{X) does not use bound variables with other names and types 
than the ones already used in the original problem. This ensures that we can always 
translate back bound variables a as the atom with the same name a. Terms formed 
by a constant or free variable are particular cases of applications with ?n = 0, 
studied bellow. 

All A-abstractions will be of the form Aa^.t, where Oi = [[a;]. This ensure that its 
translation back is possible, if the body of the A-abstractions is back-translatable. 

All applications are of the form /(ti, . . . , tm) where / is a constant of the original 
nominal problem (since a is most general) , or of the form A" (a^j^ , . . . , a^^ ) where X 
is a free variable and a^^ , . . . , a^^ are distinct bound variables. Notice that we can 
no have terms of the form ai(ti, . . . ,t„) where ai is a bound variable, because all 
these bound variables have basic types. In the first case, the application is back- 
translatable if arguments are. In the second case, let X : tj-^ — > . . . — ;> tj^ — > 5, for 
some indexes satisfying 1 < ji < ■ ■ ■ < jm ^ n. using the V constructed before, 
we can translate back A(aij^, . . . , a^^) as tt^-'-'X, for some tt satisfying 7r(ai|^) = aj^, 
for J = 1, . . . , m. Notice that aj,^ and ai^ have the same sort tj,, . Hence, this second 
kind of applications is also back-translatable. D 

Theorem 7.7. For every equational nominal unification problem P , if the pat- 
tern unification problem |P] is solvable, then P is also solvable. 

Proof. By Lemma 7.6, if JP] is solvable then there exist a most general uni- 
fier a of JP], and a freshness environment V such that (V, Jcr] y) is defined. 
W.l.o.g. assume that Dom(cr) — Vars(JP]) and hence, according to Definition 7.2, 



^We say that X occurs in a, if X occurs free in criY), for some Y G Doni(cr). 
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= cr, which solves 



Dom(|(T] y) — Vars(P). By Lemma 7.4, we have \\a\\ 
|P]. Hence, by Theorem 5.13, (V, IcrF^y) solves P. D 

From Theorems 5.15 and 7.7, and hnear-time decidability for Higher-Order Pat- 
terns Unification [Qian 1996], we conclude the following results. 

Corollary 7.8. Nominal Unification is quadratic reducible to Higher-Order 
Pattern Unification. 
Nominal Unification can be decided in quadratic deterministic time. 

8. CORRESPONDENCE BETWEEN UNIFIERS 

In this section we establish a correspondence between the solutions of a nominal 
unification problem and their translations. We prove that the translation function 
is monotone, in the sense that it translates more general nominal solutions into 
more general pattern solutions. The reverse translation also satisfies this property. 
Therefore, both translate most general solutions into most general solutions. We 
start by generalizing the translation of a nominal substitution w.r.t. a freshness en- 
vironment, to respect the translation of a nominal substitution w.r.t. two freshness 
environments, and similarly for the reverse translation. 

Definition 8.1. Let (ai, . . . , an) be a fixed list of atoms. 

Given a nominal .substitution a, and two freshness environments V and V', sat- 
isfying V h (t(V'), we define 

XeDom(CT) 

where (bi,...,bm) = (a £ (ai,...,ap) | a#X^ V). 

Given a pattern substitution a , and two freshness environments V and V , we 
define 

H^'v- U [y<^Mx)ibi,...,bm)r\' 

XeDom(CT) 

where (bi,...,bni) = (a £ (ai,...,an) | a#X^ V). 

We say that a is 'V' ^ ^-compatible if ^aj y exists. 

Notice that this definition generalizes Definition 5.8 because Jcrjy = |cr]y, and 

-1 -i" 

Definition 7.2 because, |cr] v = M v 

The following lemmas are generalizations of Lemmas 5.11 and 7.4, respectively. 

Their proofs are also straightforward generalizations. 

Lemma 8.2. For any nominal substitution a, freshness environments Vi and 
V2, and nominal term t, satisfying V2 ^ ""(Vi) and Vars(t) C Dom{a), we have 

Wv^(WvJ = Kt)lv. 

Lemma 8.3. For any \- substitution a and freshness environment \I i and\72, if 
CT Z5 Vi — > V 2- compatible, then 
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If a A-substitution cti is more general than another CT2, then there exists a sub- 
stitution (J3 that satisfies 02 — oj, ° <J\. The foUowing lemma states that this 
substitution can be used to construct a nominal substitution [(Ts] that we will 
use, in Lemma 8.5, to prove that |cri] is more general than |(72] 

Lemma 8.4. For any 'pair of \- substitutions cti and G2 and freshness environ- 
ments Vi and V2, if ui is Vi- compatible, ui is V 2 -compatible, and a\ is more 
general than 02, then there exists a \- substitution a-j, such that 

(^) CT3 is Vi — > '^ 2- compatible, and 

(3) S72hla,r'lli^i)- 

Proof. The first conclusion is a consequence of Ci is more general than (72. 
However, w.l.o.g. we take a CT3 that only instantiates variables occurring in ai or 
belonging to Dom(cr2)- 

For all X e Dom(cr3), let (bi,...,bm) = (aj | aj^X ^ Vi), where (ai,...,an) 
is the fixed list of atom names. Now, X occurs in cti or X e Dom(cr2)- In the 
first case, since ai is Vi-compatible and we are dealing with higher-order pattern 
substitutions, X occurs in ai in (at least one) subtcrm of the form X{bi, . . . , 6^), 
where b[ are distinct bound variables with names in (ai,...,an), and bi and 6^ 
have the same type. Moreover, a-3{X){b[, . . . ,b[j^), conveniently /3-reduced, is a 
subtcrm of some (72 (^), for some Y G Dom((72). In the second case, if X G 
Dom(cr2)j we also have this property. Therefore, since CT2 is V2-compatible, we have 
that a3{X){b[, . . . , b'„J, and hence a-3{X){bi, . . . , bm) is V2-compatible. Therefore, 

ksrVs = UxeDom(^3)[X ^ k3{X){bi,...,brn)r^] cxists, and 0-3 is Vi -^ V2- 
compatible. 

Let be b^^X G Vi. The free variable names of ^^{X) and (ai,...,ar,) 
are disjoint. Therefore, b ^ FV{a^{X){bi, . . . ,b„i)), where (bi,...,bm) = 
(ai I ai#X ^ Vi). By Lemma 7.4, since a3{X){bi, . . . ,bm) is V2- 

'la3(X)(6i,...,6„0r'v, 



compatible, we have b ^ FV 



By Lemma 5.10, 



V2hb#Ia3(^)(&i,.-.,&m)l 'v.- By Definition 8.1, V2hb# 1^31 'lliX). There- 
fore, we have V2 \- lo-^}^ ^ (^i)- □ 

The following lemma ensures that the translation and reverse translation of sub- 
stitutions is monotone w.r.t. the more generality relation. 

Lemma 8.5. For every nominal unification problem P and pair of unifiers 
(Vi,(Ti) and (^2,(^2), satisfying Vars(P) C Dom(cri) C Dom((T2)7 we have (Vi,CTi) 
is more general than (V2,(T2), if, and only if, Jcrijy is more general than [[fT2]Y, . 

Proof. =>) By Theorem 5.13, both [[cri]y and [[cr2lv ^^^ solutions of JP]. If 
(Vi,cri) is more general than (V2,(T2), then there exists a nominal substitution 
a' such that V2 h cr'(Vi) and V2 h ct' o cri|Dom(<Ti)uDom(o-2) ~ f^i- For all X G 
Dom(cr2), we have V2 h a'{ai{X)) « a2{X). By Lemma 5.10, K(o-i(X))lv2 =" 
Ia2(X)lv,. By Lemma 8.2, |a'gj(Iai(X)lvJ =a ^20^)1^,- % Lemma 5.11, 
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Kly:(I^ilv.(IXl0)) =" Mv.dXle). Since IXI0 = X(ai,...,a„) and a, will be 
distinct free variables, we have 

Ia2lv,(^) = Kg^ o lail^^(X), for all X e BomHa^hJ 

Therefore, Icijy is more general than |cr2]y . 

'^) There exists a A-substitution a' such that IcraJv^ = cr'° [c^ilvi lDom(<Ti)uDom(CT2)- 

By Lemma 8.4, a' is Vi — > V2-compatible. Hence, it exists the nominal substi- 

tution <t" = |(t'] y . For any X G Dom(cr2), by Lemmas 8.2 and 8.3, we have 

[[cr2(X)lv2- By Lemma 5.10, we have V2 h cr"(cri(X)) « cr2(X). Therefore, 
V2 I- cr" o cri|Dom(CTi)uDom(<T2) ~ (^i- By Lemma 8.4, we also have V2 h cr"(Vi). 
From both facts, we conclude that cti is more general than (72 ■ □ 

Corollary 8.6. Most general nominal unifiers are unique. 

Proof. It is a direct consequence of uniqueness of most general higher-order 
pattern unifiers and Lemma 8.5. D 

Finally we can conclude that the translations preserve most generality. 

Theorem 8.7. For any nominal problem P and nominal solution (VjCr), satis- 
fying Vars(P) C Dom((T), (V,(t) is a most general unifier if, and only if Jcrjy is a 
most general unifier of |P] . 

Proof. ^) Suppose that (V,i7) is a most general nominal unifier of P, but 
Io-Jy is not a most general pattern unifier of JP]. By Theorem 5.13, Icrjy is a 
solution of |P]. Since most general higher-order pattern unifiers are unique, and 
by Lemma 7.6, there exists a most general pattern unifier a' of |P] strictly more 

general than Icrjy and such that la'}^ exists. By Lemma 7.4, |cr']~ ~ cr' . 
Since we assume that (V, a) is most general and nominal most general unifiers are 
also unique, we have that (V, cr) is more general than Jcr'] . Hence, by Lemma 8.5, 
|a]y is more general than |(t'] = a' , which contradicts that cr' is strictly more 

general than |a-]y. 

<^) Suppose that fcrj.^ is most general, and (V,cr) is not. Then, there exists 
a most general unifier (V',(j') such that (V,i7) is not more general than (V',cr'). 
On the other hand, since |cr]y is most general, it is more general than |cr']y, . 
Hence, by Lemma 8.5, (V,cr) is more general than (V',cr'). This contradicts the 
initial assumption. Therefore, if Jcrjy is most general, then (V,cr) must be most 
general. D 

9. CONCLUSIONS 

The paper describes a precise quadratic reduction from Nominal Unification to 
Higher-Order Pattern Unification. This helps to better understand the semantics 
of the nominal binding and permutations in comparison with A-binding and a- 
conversion. Moreover, using the result of linear time decidability for Higher-Order 
Patterns Unification [Qian 1996], we prove that Nominal Unification can be decided 
in quadratic time. 
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